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A METHOD FOR PLACEMENT OF DATA FOR VISUALIZATION OF 
MULTIDIMENSIONAL DATA SETS USING MULTIPLE PIXEL BAR CHARTS 

FIELD OF INVENTION 

The present invention relates to the field of information visualization. 
Specifically, the present invention relates to a method for placement of data for 
visualization of multidimensional data sets using multiple pixel bar charts. 

BACKGROUND OF THE INVENTION 

Pixel-oriented techniques allow for the presentation of large amounts of data 
having detailed information. To date, many visualization applications utilize of 
pixel-oriented techniques, such as the spiral technique, the recursive pattern technique 
and the pixel bar chart technique. The pixel bar chart technique is for presenting the 
data values directly rather than aggregating the data into a few data values. The 
approach is to represent each item by a single pixel in the bar chart. 

However, for an analysis of large volumes of data (e.g., large volumes of e- 
commerce transactions), the visualization of complex data is not sufficient. In order 
to optimally view and interpret the data it is necessary to optimally place the data. It 
is not obvious what an optimal placement of pixel should be. The optimal placement 
of data associated with visualization of massive multi-attribute data presents 
particular problems. 
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For one, in large data sets (e.g., dense data sets), limitations exist on pixel 
visualizations that are capable of showing large amounts of data on a value by value 
basis without aggregation. Current techniques do not offer optimal arrangement of 
5 data for detecting close relationships among the data. Also, current techniques do not 
offer an optimal arrangement of data for representing multiple attributes. 

Current pixel visualization techniques also do not present data such that 
similar data records are placed close to each other. It is desirable to place similar 
records close to each other to fully appreciate the information presented by the data. 
For example, the detection of trends among data requires that similar records be placed 
in close proximity. 

Furthermore, current pixel visualization techniques do not arrange data in a 
logical sequence. Currently techniques provide insufficient placement to resolve the 
locality and ordering constraints. As such, data is not categorized, and is more 
difficult to understand. 

Also, limitations exist such that current pixel visualization techniques are 
20 restricted with respect to the number of the data attributes that may be used. Current 
techniques are typically limited to either four or six attributes. Since it is not 
uncommon to have data sets comprised of greater than 10 attributes, this limitation 
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can severely restrict optimal viewing and interpretation of the data. 

Accordingly, a need exists for a method for optimal placement of data 
associated with visualization of massive multi-attribute data in pixel bar charts. A 
need also exists for a method for presenting an overview of the data while also 
presenting the detailed information for each data item. A need also exists for a method 
for presenting data in a manner for detecting close relationships and trends among the 
data. A need also exists for a pixel visualization technique that presents data in a 
categorized manner that is easy to understand. 
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SUMMARY OF THE INVENTION 

A method for placement of data for visualization of multidimensional data sets 
using multiple pixel bar charts is presented. Data is received comprising a plurality of 
records, each record having a plurality of attributes. From the plurality of attributes, 
a set of attributes is determined for placement of the plurality of records in a 
graphically displayable array comprising a plurality of data points, each data point 
representing one record of the plurality of records. In one embodiment, the 
graphically displayable array is a pixel bar chart. The plurality of records are arranged 
to construct the pixel bar chart for presenting the data in a format for detecting 
relationships between the plurality of records. The present invention provides a pixel 
visualization technique having a method of optimal data placement. 

In one emboaiment, the plurality of records are sorted by a first dividing 
attribute corresponding to horizontal axis, and partitioned into groups according to the 
first dividing attribute. In one embodiment, the plurality of records are then sorted by 
a second dividing attribute corresponding to vertical axis, and partitioned into sub- 
groups according to the second dividing attribute. The records of each of each sub- 
group are then sorted according to a first ordering attribute and a second ordering 
attribute. A visual indicator (e\g., a color) is then applied to each record according to a 
visual indicator attribute. \ 
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The present invention provides a method of efficient pixel placement for 
producing dense pixel visualizations that are capable of showing large volumes of 
multi-dimensional data sets and provides a heuristic placement method to resolve 
locality and ordering constraints. The present invention allows for the partition of the 
data set according to the x-axis and y-axis and for placement of pixels of each partition 
in the corresponding regions according to the x-and y- ordering. As such, the present 
invention provides a method for generating a pixel bar chart for detecting close 
relationships and trends among the data that is categorized and easy to understand. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

The accompanying drawings, which are incorporated in and form a part of this 
specification, illustrate embodiments of the invention and, together with the 
description, serve to explain the principles of the invention: 

FIGURE 1 is a flowchart diagram illustrating steps in a process for generating 
a pixel bar chart in accordance with an embodiment of the present invention. 

FIGURE 2 is a flowchart diagram illustrating steps in a process for placement 
of data for visualization of multidimensional data sets using multiple pixel bar charts 
in accordance with an embodiment of the present invention. 

FIGURE 3 a is a block diagram illustrating the partitioning of records into 
groups accordingly in accordance with an embodiment of the present invention. 

FIGURE 3b is a block diagram illustrating the sorting of a plurality of records 
within a group by a second dividing attribute and partitioning the records into sub- 
groups accordingly in accordance with an embodiment of the present invention. 

FIGURE 3 c is a block diagram illustrating the arranging of a plurality of 
records within a sub-group by a first ordering attribute and a second ordering attribute 
in accordance with an embodiment of the present invention. 
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FIGURE 4 is a flowchart diagram illustrating steps in a process for pixel 
placement within a sub-group of a pixel bar chart in accordance with an embodiment 
of the present invention. 

FIGURE 5 is an illustration of an exemplary sub-group 500 in accordance with 
one embodiment of the present invention. 
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DETAILED DESCRIPTION 

In the following detailed description, for purposes of explanation, numerous 
specific details are set forth in order to provide a thorough understanding of the 
present invention. However, it will be apparent to one skilled in the art that the 
present invention may be practiced without these specific details. In other instances, 
well-known structures and devices are not described in detail in order to avoid 
obscuring aspects of the present invention. 

Some portions of the detailed descriptions which follow are presented in terms 
of procedures, steps, logic blocks, processing, and other symbolic representations of 
operations on data bits within a computer memory. These descriptions and 
representations are the means used by those skilled in the data processing arts to most 
effectively convey the substance of their work to others skilled in the art. A 
procedure, computer executed step, logic block, process, etc., is here and generally 
conceived to be a self-consistent sequence of steps of instructions leading to a desired 
result. The steps are those requiring physical manipulations of data representing 
physical quantities to achieve tangible and useful results. It has proven convenient at 
times, principally for reasons of common usage, to refer to these signals as bits, 
values, elements, symbols, characters, terms, numbers or the like. 

Portions of the present invention are comprised of computer-readable and 
computer executable instructions which reside, for example, in computer-usable media 
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of a computer system. It is appreciated that the present invention can operate within 
a number of different computer systems including general purpose computer systems, 
embedded computer systems, and stand alone computer systems specially adapted 
for controlling automatic test equipment. 

Figure 1 is a flowchart diagram illustrating steps in a process 100 for 
generating a pixel bar chart in accordance with an embodiment of the present 
invention. Steps of process 100, in the present embodiment, may be implemented 
with any computer languages used by those of ordinary skill in the art. In one 
embodiment, process 100 is for graphically presenting and visually mining large 
volumes of data using a graphically displayable array. 

In one embodiment, the graphically displayable array is a pixel bar chart. In 
one embodiment, each record is represented by one unique pixel of a display. In 
another embodiment, each record is represented by a data point of a display, wherein 
the data point comprises a plurality of pixels. 

At step 1 10 of process 100, data comprising a plurality of records is received. 
Each record comprises a plurality of attributes, wherein each attribute corresponds to 
a particular item of information of the record. For example, a consumer electronics 
business has a record for each order the business handles. In this example, each record 
may have attributes corresponding to the order number, the price of the order, the 
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customer identification number (ID), and the quantity of items ordered, as well as 
other attributes corresponding to customer data. It should be appreciated that each 
attribute of each record represents a value of the represented data item. As such, 
references to an attribute (e.g., first dividing attribute) with respect to a record are 
intended to refer to the corresponding value for that attribute of the record. 

At step 120, a set of attributes is determined from the plurality of attributes, 
wherein the set of attributes is for placement of the plurality of records in the 
graphically displayable array. In one embodiment, a first dividing attribute, a second 
dividing attribute, a first ordering attribute, a second ordering attribute and a visual 
indicator attribute are selected. 

In one embodiment, the first dividing attribute corresponds to a horizontal axis 
and the second dividing attribute corresponds to a vertical axis of the graphically 
displayable array. The first dividing attribute and second dividing attribute are for 
partitioning the data into groups and sub-groups. It should be appreciated that the 
first dividing attribute and the second dividing attribute are distinct attributes selected 
from the plurality of attributes. However, each of the first dividing attribute and the 
second dividing attribute may be the same as the first ordering attribute, the second 
ordering attribute and the visual indicator attribute. 
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In one embodiment, the first ordering attribute corresponds to a horizontal axis 
and the second ordering attribute corresponds to a vertical axis of the graphically 
displayable array. The first ordering attribute and second ordering attribute are for 
placing the individual records within each of the groups and sub-groups. It should be 
appreciated that the first ordering attribute and the second ordering attribute are 
distinct attributes selected from the plurality of attributes. However, each of the first 
dividing attribute and the second dividing attribute may be the same as the first 
dividing attribute, the second dividing attribute and the visual indicator attribute. 

In one embodiment, the visual indicator attribute corresponds to a visual 
indicator. In one embodiment, the visual indicator is a color for pixel coloring. It 
should be appreciated that the visual indicator attribute is selected from the entire 
plurality of attributes. As such, the visual indicator attribute may be the same as any 
of the other selected attributes. 

At step 130 of process 100, the plurality of records are arranged to construct 
the graphically displayable array for presenting said data in a format for detecting 
relationships between the plurality of records. In one embodiment, the present 
invention provides a process for placement of data for visualization of 
multidimensional data sets using multiple pixel bar charts in accordance with an 
embodiment of the present, as described in process 200 of Figure 2. 
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The present invention provides a method of efficient pixel placement for 
producing dense pixel visualizations that are capable of showing large volumes of data 
sets and provides a heuristic placement method to resolve locality and ordering 
constraints. The present invention allows for the partition of the data set according to 
5 the x-axis and y-axis and for placement of pixels of each partition in the corresponding 
regions according to the x-and y- ordering. As such, the present invention provides a 
method for generating a pixel bar chart for detecting close relationships and trends 
% among the data that is categorized and easy to understand. 

m 
ru 

K P 10 Figure 2 is a flowchart diagram illustrating steps in a process 200 for 

Ey 

I placement of data for visualization of multidimensional data sets using multiple pixel 

O bar charts in accordance with an embodiment of the present invention. Steps of 

^ process 200, in the present embodiment, may be implemented with any computer 

languages used by those of ordinary skill in the art. In one embodiment, process 200 
15 is for graphically presenting and visually mining large volumes of data using a 

graphically displayable array. 

At step 210 of process 200, the plurality of records are sorted by the first 
dividing attribute. In one embodiment, the first dividing attribute is selected for 
20 partitioning the plurality of records (e.g., the data) into groups. In one embodiment, 
the. groups correspond to a partitioning of the screen of a display according to the 
horizontal axis (e.g., x-axis). 
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At step 220, the records are partitioned into groups according to the first 
dividing attribute. In one embodiment, the first dividing attribute corresponds to the 
vertical axis (D x ). 

Figure 3a is a block diagram of a graphically displayable array 300 (e.g., a pixel 
bar chart) as shown on a display illustrating the partitioning of records into groups 
306 accordingly in accordance with an embodiment of the present invention. 

Graphically displayable array 300 has horizontal axis (e.g., x-axis) 302 and 
vertical axis (e.g., y-axis) 304 and comprises a plurality of data points (e.g., pixels). 
The records are sorted by the first dividing attribute, which corresponds to horizontal 
axis 302. The records are divided into groups 306 according to each record's value of 
the first dividing attribute. 

At step 230, the records are partitioned into sub-groups according to the 
second dividing attribute. In one embodiment, the second dividing attribute 
corresponds to the vertical axis (D y ). It should be appreciated that in an embodiment 
of the present invention, no attribute is specified for the second dividing attribute. 

Figure 3b is a block diagram of a graphically displayable array 320 (e.g., a pixel 
bar chart) as shown on a display illustrating the partitioning of records within a group 
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306 into sub-groups 308 accordingly in accordance with an embodiment of the present 
invention. 

Graphically displayable array 320 has horizontal axis (e.g., x-axis) 302 and 
vertical axis (e.g., y-axis) 304 and comprises a plurality of data points (e.g., pixels). 
The records are divided into groups 306 according to each record's value of the first 
dividing attribute. Within each group 306, the records are divided into sub-groups 308 
according to each record's value of the second dividing attribute. 

At step 240, the records of each sub-groups (or group if no second dividing 
attribute is selected) are arranged and placed according to the first ordering attribute 
and the second ordering attribute. In one embodiment, the arrangement and placement 
of pixels is conducted according to a process for pixel placement within a sub-group 
of a pixel bar chart as described in process 400 of Figure 4. In one embodiment, the 
first ordering attribute corresponds to the horizontal axis (OJ and the second ordering 
attribute corresponds to the vertical axis (O y ). 

Figure 3c is a block diagram of graphically displayable array 340 (e.g., a pixel 
bar chart) as shown on a display illustrating the arranging of a plurality of records 
within a sub-group 308 by a first ordering attribute and a second ordering attribute in 
accordance with an embodiment of the present invention. 
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Graphically displayable array 340 has horizontal axis (e.g., x-axis) 302 and 
vertical axis (e.g., y-axis) 304 and comprises a plurality of data points (e.g., pixels). In 
one embodiment, the records are divided into groups 306 according to each record's 
value of the first dividing attribute (e.g., no second dividing attribute is determined). 
The records of each group 306 are placed according to a first ordering attribute and a 
second ordering attribute. Ordering arrows 310 and 312 indicate the direction of 
record placement within each group 306. 

In another embodiment, the records are divided into sub-groups 308 according 
to each record's value of the first dividing attribute and value of the second dividing 
attribute. The records of each sub-group 308 are placed according to a first ordering 
attribute and a second ordering attribute. Ordering arrows 310 and 312 indicate the 
direction of record placement within each sub-group 308. 

At step 250, a visual indicator is applied to each record according to the visual 
indicator attribute. In one embodiment, the visual indicator is a color. In one 
embodiment, non-linear 256 RGB (red-green-blue) color scale is used for determining 
the color for each record. In another embodiment, a non-linear gray-scale color scale is 
used to determine the color for each record. It should be appreciated that any color 
scale or range, both linear and non-linear, may be used in regard to the present 
invention. In one embodiment, the value of the visual indicator attribute is normalized 
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to the range 0 to 1 . In one preferred embodiment, the normalization is nonlinear. 
Then the range 0 to 1 is mapped to a color range. 

In one embodiment, where multiple pixel bar charts are used, different 
5 attributes are the visual indicator attribute in different bar charts, which enables a user 
to relate the different coloring attributes and detect partial relationships among the 
records. In the present embodiment, the dividing and ordering attributes have to stay 
in the same order: 

CO 

«S 3 
: 

!? 10 <D X , D y , 0 X) O yi C > 

□ where D X) D y , 0 Xi O y) C e {A h ...,A k } u_L and DJDy are the dividing attributes in x- 

^ /y-direction, OJOy are the ordering attributes in x-/y-direction, and C is the coloring 

attribute. The element- 1 - is used if no attribute is specified. 

15 

Figure 4 is a flowchart diagram illustrating steps in a process 400 for pixel 
placement within a sub-group of a pixel bar chart in accordance with an embodiment 
of the present invention. The pixel placement within one sub-group requires a two- 
dimensional sort. The present invention provides a heuristic process for pixel 
20 placement by partitioning the data set into subsets according to the first ordering 
attribute (O x ) and the second ordering attribute (O y ,). 
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At step 410, the one-dimensional histograms for O x and O y are determined for 
providing an efficient pixel placement within a single sub-group. The one-dimensional 
histograms for O x and O y are used to determine the 3-quantiles of O x and O r In one 
embodiment, if the sub-group under consideration has extension w x h pixels, the 

l/w,...,(w-l)/w-quantities for the partitioning of O x and the l/h (h-l)/h- quantities 

for the partitioning of O y are determined. The quantiles are then used to determine the 
partitions X h ...,X W of O x and Y h ...,Y h of O r The partitions X h ...,X W are sorted 
according to O y and the partitions Y h .„ y Y h according to O x . 

At step 420, the pixel in the lower-left corner is placed (e.g., position (1,1), of 
the sub-group). In one embodiment, the pixel is placed according to Equation 1 : 



At step 430, all pixels in the lower pixel row and the left pixel row of the sub- 
group are placed. In one embodiment, the lower pixel row is placed according to 
Equation 2: 



Equation 1 




Equation 2 
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r'(U)={^i^.K. a2 }} 



Vz = l..w 



In one embodiment, the left pixel row is placed according to Equation 3: 



Equation 3 



y/ = i..A 



At step 440, the remaining pixels are iteratively placed in the sub-group. In 
one embodiment, the pixels are placed starting from the lower left of the sub-group to 
the and ending at the upper right of the sub-group. In one embodiment, if pixels at 
positions (i-l,j) and (hj-1) are already placed, the pixel at position (i, j) is determined 
according to Equation 4: 



In one embodiment, where the data structures have been placed as described in 
step 410, the pixel to be placed at position can be determined in O(l) time if 



Equation 4 




if Xi r\Y;*0 
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Xi<~\Yj * 0 . If XinYj = 0, the partitions X { and Yj must be iteratively extended and 
d s e ( X, u X l+1 ) n Yj be considered. 

In one embodiment, if this set is still empty, d s e (X, <u u Y j+l ) must 

be considered, and so on, until a data point to be placed is found. It should be 
appreciated that process 400 is efficient due to the data structure used. 

Figure 5 is an illustration of an exemplary sub-group 500 in accordance with 
one embodiment of the present invention. It should be appreciated that sub-group 
500 may be a group in instances where no second dividing attribute is selected. Sub- 
group 500 comprises a plurality of pixels. In one embodiment, the plurality of pixels 
are pixels of a display screen (e.g., a computer monitor). 

Lower-left pixel 502 (e.g., position (1,1)) is placed first according to step 420 
of process 400 (Figure 4). Lower pixel row 504 (e.g., position (/,1)) an d left pixel row 
506 (e.g., position (1,0) are placed next, according to step 430 of process 400. 
Remaining pixels 508 (e.g., position (i+l, i+l),) are then placed in sub-group 500, 
according to step 440 of process 400. 

The preferred embodiment of the present invention a method for placement of 
data for visualization of multidimensional data sets using multiple pixel bar charts, is 
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thus described. While the present invention has been described in particular 
embodiments, it should be appreciated that the present invention should not be 
construed as limited by such embodiments, but rather construed according to the 
below claims. 
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